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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Bandman, Olga 

Guegler, Karl J. 
Lai, Preeti 

(ii) TITLE OF THE INVENTION: SH3 -CONTAINING PROTEINS 



(iii) NUMBER OF SEQUENCES : 5 

( iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Dr. 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP : 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0419 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-855-0555 

(B) TELEFAX: 65 0-845-416 6 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 65 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(vii) IMMEDIATE SOURCE: 

(A) LI BRARY : BRA I TUT 0 3 

(B) CLONE: 865744 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Lys Arg Thr Val Ser Asp Asn Ser Leu Ser Asn Ser Arg Gly Glu 

1 5 10 15 

Gly Lys Pro Asp Leu Lys Phe Gly Gly Lys Ser Lys Gly Lys Leu Trp 

20 25 30 

Pro Phe lie Lys Lys Asn Lys Gly Ala Thr Pro Glu Asp Phe Ser Asn 

35 40 45 

Leu Pro Pro Glu Gin Arg Arg Lys Lys Leu Gin Gin Lys Val Asp Glu 
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-4J 
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60 










Leu 


Asn 


Lys 


Glu 


He 


Gin 


Lys 


Glu 


Met 


Asp 


Gin 


Arg 


Asp 


Ala 


He 


Thr 


65 










70 










75 










80 


Lys 


Met 


Lys 


Asp 


Val 


Tyr 


Leu 


Lys 


Asn 


Pro 


Gin 


Met 


Gly 


Asp 


Pro 


Ala 










85 










90 








95 




Ser 


Leu 


Asp 


His 


Lys 


Leu 


Ala 


Glu 


Val 


Ser 


Gin 


Asn 


He 


Glu 


Lys 


Leu 








100 










105 










110 




Arg 


Val 


Glu 


Thr 


Gin 


Lys 


Phe 


Glu 


Ala 


Trp 


Leu 


Ala 


Glu 


Val 


Glu 


Gly 






115 










120 










125 






Arg 


Leu 


Pro 


Ala 


Arg 


Asn 


Glu 


Gin 


Ala 


Arg 


Arg 


Gin 


Ser 


Gly 


Leu 


Tyr 




130 










135 










140 






Asp 


Ser 


Gin 


Asn 


Pro 


Pro 


Thr 


Val 


Asn 


Asn 


Cys 


Ala 


Gin 


Asp 


Arg 


Glu 


145 










150 










155 






160 


Ser 


Pro 


Asp 


Gly 


Ser 
165 


Tyr 


Thr 


Glu 


Glu 


Gin 
170 


Ser 


Gin 


Glu 


Ser 


Glu 
175 


Met 


Lys 


Val 


Leu 


Ala 
180 


Thr 


Asp 


Phe 


Asp 


Asp 
185 


Glu 


Phe 


Asp 


Asp 


Glu 
190 


Glu 


Pro 


Leu 


Pro 


Ala 


He 


Gly 


Thr 


Cys 


Lys 


Ala 


Leu 


Tyr 


Thr 


Phe 


Glu 


Gly 


Gin 
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Glu 


Gly 


Thr 


He 


Ser 


Val 


Val 


Glu 


Gly 


Glu 


Thr 


Leu 


Tyr 


Val 


He 
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220 








Glu 


Glu 


Asp 


Lys 


Gly 


Asp 


Gly 


Trp 


Thr 


Arg 


He 


Arg 


Arg 


Asn 


Glu 


Asp 


225 










230 










235 










240 


Glu 


Glu 


Gly 


Tyr 


Val 


Pro 


Thr 


Ser 


Tyr 


Val 


Glu 


Val 


Cys 


Leu 


Asp 


Lys 










245 










250 










255 
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L ys 


Gly 
260 


Ala 
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Thr 


Tyr 


He 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1459 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY : BRAITUT03 

(B) CLONE: 865744 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

AGTAAAAGCA GCCGAATCAA TTGATCAGAA AAATGATTCA CAGCTGGTAA TAGAAGCTTA 60 

TAAATCAGGG TTTGAGCCTC C TGG AG AC AT TGAATTTGAG GATTACACTC AGCCAATGAA 12 0 

GCGCACTGTG TCAGATAACA GCCTTTCAAA TTCCAGAGGA GAAGGCAAAC CAGACCTCAA 180 

ATTTGGTGGC AAATCCAAAG GAAAGTTATG GCCGTTCATC AAAAAAAATA AGGGTGCAAC 2 40 

ACCGGAGGAT TTCAGCAACC TCCCACCTGA ACAAAGAAGG AAAAAGCTGC AGCAGAAAGT 3 00 

CGATGAGTTA AATAAAGAAA TTCAGAAGGA GATGGATCAA AGAGATGCCA TAACAAAAAT 3 60 

GAAAGATGTC TACCTAAAGA ATCCTCAGAT GGG AG AC C C A GCCAGTTTGG ATCACAAATT 42 0 

AGCAGAAGTC AGCCAAAATA TAGAGAAACT GCGAGTAGAG ACCCAGAAAT TTGAGGCCTG 4 80 

GCTGGCTGAG GTTGAAGGCC GGCTCCCAGC ACGCAACGAG CAGGCGCGCC GGCAGAGCGG 540 

ACTGTACGAC AGCCAGAACC CACCCACAGT CAACAACTGC GCCCAGGACC GTGAGAGCCC 6 00 

AGATGGCAGT TACACAGAGG AGCAGAGTCA GGAGAGTGAG ATGAAGGTGC TGGCCACGGA 6 60 

TTTTGACGAC GAGTTTGATG ATGAGGAGCC CCTCCCTGCC ATAGGGACGT GCAAAGCTCT 72 0 

CTACACATTT GAAGGTCAGA ATGAAGGAAC GATTTCCGTA GTTGAAGGAG AAACATTGTA 7 80 

TGTCATAGAG GAAGACAAAG GCGATGGCTG GACCCGCATT CGGAGAAATG AAGATGAAGA 840 

GGGTTATGTC CCCACTTCAT ATGTCGAAGT CTGTTTGGAC AAAAATGCCA AAGGTGCTAA 900 

GACTTATATT TAATACCATA AAAAAAAAAA ACTTAAAAAA AATGGAGTTG TTTCTCCCCA 9 60 

CAACCGTGAC TGTTACAGGC AGTTCCTCAA GAGACTGGCT GGCAAGCACC ATAATGCACG 102 0 

TTCTCCTGTA GTCTCACGTG GACTTCAGGG TCCGGGCACC TGAATTGCCT TGTCTAGTTT 10 80 

GGGCTGTAAT CAAGTTTCAC TTGCTGATGA AATTTTATGT GGAAAGCTGC CAACCGCCAA 1140 

CTTACAGCTA TGTCATTCAA AATC TGATAA ACATTTCTTC TTTTGGCGGT ATCTGTAGAT 12 00 

TAAAAAAAAA GTTGCATTGT AGCTTCTCAT CTTTCTGAAT TTAAAAGCCG GCACGCATCA 12 60 

TGCAGGTGCC AAAGACTTCC CTACTCTTGT TTATATCTAG TATCCACCAT ACACTGAGCT 132 0 

ACATTAGGTG GTTACAGATT GTAACTTAAT AAACTGAACT GTGTTAGTTT GTTAAATTGG 13 80 

ATACTCATTC ACTTGGGGAG GAGTCACAAG TGAAAT AC C A TCTCTTTCTT GACTAAAGCG 1440 
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GTAAATAAGG TTCTTATTG 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

( A ) LI BRARY : PRO SNOT 2 0 

(B) CLONE: 1816529 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Lys Asp Val Tyr Glu Lys Thr Pro Gin Met Gly Asp Pro Ala Ser 

5 10 15 

Leu Glu Pro Gin He Ala Glu Thr Leu Ser Asn He Glu Arg Leu Lys 

20 25 30 

Leu Glu Val Gin Lys Tyr Glu Ala Trp Leu Ala Glu Ala Glu Ser Arg 
M 35 40 45 y 

*? Val ^ U Ser ASn Arg Gly Asp Ser Leu Ser Ar ^ His Ala Arg Pro Pro 

yd 55 60 

FU ^ a Pro Pro Ala Ser Ala p ro Pro Asp Ser Ser Ser Asn Ser Ala Ser 

« pi 70 75 

«« Gin Asp Thr Lys Glu Ser Ser Glu Glu Pro Pro Ser Glu Glu Ser Gin 

L 3 85 90 95 

|_g Asp Thr Pro He Tyr Thr Glu Phe Asp Glu Asp Phe Glu Glu Glu Pro 

iU 100 105 HO 

Thr Ser Pro lie Gly His Cys Val Ala He Tyr His Phe Glu Gly Ser 

p 115 120 125 

"» ?™ GlY Thr I1S SSr Met Ala Glu Glu As P Leu Ser Leu Met 

S „ "° 135 140 

S 1I5 Asp Lys g1y a so Gly Trp Thr Arg Val Arg Arg Lys Glu G1 y 

3 Glu Glv Tyr Val Pro Thr Ser Tyr Leu Arg Val Thr Leu Asn 

H 165 I 70 175 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LI-BRARY : PROSNOT2 0 

(B) CLONE: 1816529 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATGAACCGTG CACCCTNCGA CAGCAGTCTG GGCACCCCCT ACGGATGGAC GGNCTGAACT 
CCGAGGNCCG GGTCGCAGCC GCACCAAGCG CTGGNCTTTT GGCAAGAAGA ACAAGACAGT 
GGTGACCGAG GATTTTAGCC ACTTGCCCCC AGAGCAGCAG CGAAAACGGC TTCAAclScI 

GTTGGAAGAA CGCAGTCGTG AACTTCAGAA GGAGGTTGAC CAGAGGGAAG CCCtISSS 240 

AATGAAGGAT GTCTATGAGA AG AC AC C TC A GATGGGGGAC CCCGCCAGCT TgS22£SJ 3uS 

GATCGCTGAA ACCCTGAGCA ACATTGAACG GCTGAAATTG GAAGTGCAGA AGTaJSgS 3 6^ 

GTGGC TGGC A GAAGCTGAAA GTCGAGTCCT TAGCAACCGG GGAGACAGCC TgSJSgS 420 

CGCCCGGCCT CCCGANCCCC CCGCTAGCGC CCCGCCAGAC AGCAGCAGCA ScgStc 480 

ACAGGACACC AAGGAGAGCT CTGAAGAGCC TCCCTCAGAA GAGAGCCAGG ACaSSIt tlo 

TTACACGGAG TTTGATGAGG ATTTCGAGGA GGAACCCACA TCCCCCATAG GTCACTGTGT fiOH 

GGCCATCTAC CACTTTGAAG GGTCCAGCGA GGGCACTATC tctatcgccg I^gtI^cI Ho 

CCTCAGTCTT ATGGAAGAAG ACAAAGGGGA CGGCTGGACC CgSSSgGC SSIgJSgS 720 

AGGCGAGGGC TACGTGCCCA CCTCCTACCT CCGAGTCACG CTCAATTGAA 773 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1255033 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
Lys He His Cys Phe Arg Ser Leu Lys Arg Gly Gly Val Thr Pro Glu 

Asp Phe Ser Asn Phe Pro Pro Glu Gin Arg Arg Lys Lys Leu Gin Gin 

20 25 30 

Lys Val Asp Asp Leu Asn Arg Glu He Gin Lys Glu Thr Asp Gin Arg 
^ 35 40 45 

U Asp Ala lie Thr Lys Met Lys Asp Val Tyr Leu Lys Asn Pro Gin Met 

% 9 on 55 60 

Gly Asp Pro Ala Ser Leu Asp Gin Lys Leu Thr Glu Val Thr Gin Asn 

^ He Glu Lys Leu Arg Leu Glu Ala Gin Lys Phe Glu Ala Trp Leu Ala 

Li 85 90 95 

G1U Val Glu <?lY Arg Leu Pro Ala Arg Ser Glu Gin Ala Arg Arg Gin 
IU 100 10 5 110 

ry Ser Gly Leu Tyr Asp Gly Gin Thr His Gin Thr Val Thr Asn Cys Ala 

115 120 12 5 

O ^ GlU Pr ° ASP GlY Ser Tyr Thr Glu Glu Gin Ser Gin 

135 140 

Glu Ser Glu His Lys Val Leu Ala Pro Asp Phe Asp Asp Glu Phe Asp 

150 155 

Asp Glu Glu Pro Leu Pro Ala He Gly Thr Cys Lys Ala Leu Tyr Thr 

Phe Glu Gly Gin Asn Glu Gly Thr He Ser Val Val Glu Gly Glu Thr 

180 185 190 

Leu Ser Val lie Glu Glu Asp Lys Gly Asp Gly Trp Thr Arg lie Arg 

195 200 205 

Arg Asn Glu Asp Glu Glu Gly Tyr Phe Pro Thr Ser Tyr Val Glu Val 

210 215 220 

Tyr Leu Asp Lys Asn Ala Lys Gly Ala Lys Thr Tyr He 
225 230 235 
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